# High-resolution video generation
Cosmos Predict2 2B Text2Image
Other
Cosmos-Predict2 is a series of high-performance pre-trained world foundation models designed to generate physics-aware images, videos, and world states, which can be used for the development of physics AI.
Text-to-Image
C
nvidia
473
19
Wan2.1 T2V 1.3B
Apache-2.0
Wan 2.1 is a comprehensive open-source video foundation model designed to push the boundaries of video generation, supporting tasks such as text-to-video and image-to-video generation.
Text-to-Video Supports Multiple Languages
W
Wan-AI
19.89k
319
Cogvideox1.5 5B I2V
Other
CogVideoX is an open-source video generation model that supports generating videos from images, similar to the Qingying platform.
English
C
THUDM
8,897
102
LTX Video
Other
The first DiT-based video generation model capable of real-time generation of high-quality videos, supporting two scenarios: text-to-video and image + text-to-video.
Text-to-Video English
L
Lightricks
165.42k
1,174
Cogvideox Fun 5b InP
Other
An improved video generation tool based on the CogVideoX architecture, supporting text/image generation of approximately 6-second, 8fps videos
Text-to-Video English
C
alibaba-pai
16
24
Cogvideox Fun 2b InP
Other
A video generation model based on the improved CogVideoX architecture, supporting text/image-to-video and multi-resolution generation
Text-to-Video English
C
alibaba-pai
52
20
Potat1
The first open-source 1024x576 text-to-video model, fine-tuned from a base model
Text-to-Video
P
camenduru
56
159
Featured Recommended AI Models